Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 1427854 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 197.0 MiB |
| Average record size in memory | 144.7 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 9 |
| DateTime | 1 |
BoolBridle has constant value "0" | Constant |
Town has a high cardinality: 1960 distinct values | High cardinality |
No_Incidents is highly overall correlated with Risk_S*I/Inspections and 3 other fields | High correlation |
Risk_S*I/Inspections is highly overall correlated with No_Incidents and 4 other fields | High correlation |
leakage_estimate_factor is highly overall correlated with No_Incidents and 2 other fields | High correlation |
Risk_S*I is highly overall correlated with No_Incidents and 3 other fields | High correlation |
Length is highly overall correlated with NumConnections | High correlation |
NumConnections is highly overall correlated with Length | High correlation |
Severity is highly overall correlated with Risk_S*I/Inspections and 1 other fields | High correlation |
Incidence is highly overall correlated with No_Incidents and 3 other fields | High correlation |
Severity is highly imbalanced (98.8%) | Imbalance |
Incidence is highly imbalanced (97.9%) | Imbalance |
Material is highly imbalanced (77.7%) | Imbalance |
NumConnectionsUnder is highly imbalanced (99.8%) | Imbalance |
gas_natural is highly imbalanced (76.0%) | Imbalance |
leakage_estimate_factor is highly skewed (γ1 = 31.24044418) | Skewed |
Length is highly skewed (γ1 = 61.11727142) | Skewed |
PipeId has unique values | Unique |
No_Incidents has 1416383 (99.2%) zeros | Zeros |
Risk_S*I/Inspections has 1416383 (99.2%) zeros | Zeros |
leakage_estimate_factor has 1416432 (99.2%) zeros | Zeros |
Risk_S*I has 1416383 (99.2%) zeros | Zeros |
NumConnections has 885989 (62.1%) zeros | Zeros |
Reproduction
| Analysis started | 2023-02-12 23:35:27.814406 |
|---|---|
| Analysis finished | 2023-02-12 23:37:20.027077 |
| Duration | 1 minute and 52.21 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
PipeId
Real number (ℝ)
| Distinct | 1427854 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8729451 × 108 |
| Minimum | 489616 |
|---|---|
| Maximum | 4.5199531 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 489616 |
|---|---|
| 5-th percentile | 9762584.2 |
| Q1 | 52652587 |
| median | 1.8972642 × 108 |
| Q3 | 2.9082495 × 108 |
| 95-th percentile | 3.9839603 × 108 |
| Maximum | 4.5199531 × 108 |
| Range | 4.5150569 × 108 |
| Interquartile range (IQR) | 2.3817236 × 108 |
Descriptive statistics
| Standard deviation | 1.2080573 × 108 |
|---|---|
| Coefficient of variation (CV) | 0.64500412 |
| Kurtosis | -0.95896068 |
| Mean | 1.8729451 × 108 |
| Median Absolute Deviation (MAD) | 1.0947898 × 108 |
| Skewness | 0.084257286 |
| Sum | 2.6742922 × 1014 |
| Variance | 1.4594025 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 56922465 | 1 | < 0.1% |
| 188184697 | 1 | < 0.1% |
| 308827198 | 1 | < 0.1% |
| 188180179 | 1 | < 0.1% |
| 51286512 | 1 | < 0.1% |
| 7941903 | 1 | < 0.1% |
| 188184880 | 1 | < 0.1% |
| 190623079 | 1 | < 0.1% |
| 31530581 | 1 | < 0.1% |
| 132954227 | 1 | < 0.1% |
| Other values (1427844) | 1427844 |
| Value | Count | Frequency (%) |
| 489616 | 1 | |
| 489645 | 1 | |
| 489646 | 1 | |
| 489780 | 1 | |
| 489790 | 1 | |
| 489792 | 1 | |
| 489793 | 1 | |
| 489981 | 1 | |
| 489982 | 1 | |
| 489996 | 1 |
| Value | Count | Frequency (%) |
| 451995309 | 1 | |
| 451995260 | 1 | |
| 451995254 | 1 | |
| 451195580 | 1 | |
| 451195430 | 1 | |
| 451195406 | 1 | |
| 451195391 | 1 | |
| 451195364 | 1 | |
| 451195284 | 1 | |
| 451195194 | 1 |
Inspections
Real number (ℝ)
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4439726 |
| Minimum | 1 |
|---|---|
| Maximum | 11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.3786941 |
|---|---|
| Coefficient of variation (CV) | 0.31023912 |
| Kurtosis | 0.80788533 |
| Mean | 4.4439726 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.89492045 |
| Sum | 6345344 |
| Variance | 1.9007975 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 781700 | |
| 6 | 196586 | 13.8% |
| 2 | 134877 | 9.4% |
| 4 | 124383 | 8.7% |
| 3 | 116200 | 8.1% |
| 1 | 64846 | 4.5% |
| 7 | 4642 | 0.3% |
| 10 | 2216 | 0.2% |
| 8 | 938 | 0.1% |
| 9 | 844 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 64846 | 4.5% |
| 2 | 134877 | 9.4% |
| 3 | 116200 | 8.1% |
| 4 | 124383 | 8.7% |
| 5 | 781700 | |
| 6 | 196586 | 13.8% |
| 7 | 4642 | 0.3% |
| 8 | 938 | 0.1% |
| 9 | 844 | 0.1% |
| 10 | 2216 | 0.2% |
| Value | Count | Frequency (%) |
| 11 | 622 | < 0.1% |
| 10 | 2216 | 0.2% |
| 9 | 844 | 0.1% |
| 8 | 938 | 0.1% |
| 7 | 4642 | 0.3% |
| 6 | 196586 | 13.8% |
| 5 | 781700 | |
| 4 | 124383 | 8.7% |
| 3 | 116200 | 8.1% |
| 2 | 134877 | 9.4% |
No_Incidents
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.008633936 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 1416383 |
| Zeros (%) | 99.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.099314918 |
|---|---|
| Coefficient of variation (CV) | 11.502856 |
| Kurtosis | 197.85137 |
| Mean | 0.008633936 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.897846 |
| Sum | 12328 |
| Variance | 0.0098634529 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1416383 | |
| 1 | 10681 | 0.7% |
| 2 | 729 | 0.1% |
| 3 | 56 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1416383 | |
| 1 | 10681 | 0.7% |
| 2 | 729 | 0.1% |
| 3 | 56 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 4 | < 0.1% |
| 3 | 56 | < 0.1% |
| 2 | 729 | 0.1% |
| 1 | 10681 | 0.7% |
| 0 | 1416383 |
Risk_S*I/Inspections
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 56 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0089574506 |
| Minimum | 0 |
|---|---|
| Maximum | 3 |
| Zeros | 1416383 |
| Zeros (%) | 99.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 3 |
| Range | 3 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.11525577 |
|---|---|
| Coefficient of variation (CV) | 12.867029 |
| Kurtosis | 338.33746 |
| Mean | 0.0089574506 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.808451 |
| Sum | 12789.932 |
| Variance | 0.013283893 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1416383 | |
| 0.7599999905 | 3019 | 0.2% |
| 0.6388888955 | 1603 | 0.1% |
| 1.75 | 1562 | 0.1% |
| 0.7200000286 | 953 | 0.1% |
| 1.222222209 | 878 | 0.1% |
| 3 | 757 | 0.1% |
| 0.6800000072 | 471 | < 0.1% |
| 0.9375 | 406 | < 0.1% |
| 0.6111111045 | 277 | < 0.1% |
| Other values (46) | 1545 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1416383 | |
| 0.3700000048 | 4 | < 0.1% |
| 0.3799999952 | 2 | < 0.1% |
| 0.3899999857 | 17 | < 0.1% |
| 0.407407403 | 1 | < 0.1% |
| 0.4197530746 | 1 | < 0.1% |
| 0.4320987761 | 3 | < 0.1% |
| 0.453125 | 1 | < 0.1% |
| 0.46875 | 2 | < 0.1% |
| 0.484375 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 3 | 757 | |
| 2.666666746 | 2 | < 0.1% |
| 2.5 | 12 | < 0.1% |
| 2.4375 | 6 | < 0.1% |
| 2.222222328 | 64 | < 0.1% |
| 2.039999962 | 13 | < 0.1% |
| 2 | 146 | < 0.1% |
| 1.919999957 | 5 | < 0.1% |
| 1.799999952 | 2 | < 0.1% |
| 1.777777791 | 4 | < 0.1% |
leakage_estimate_factor
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 393 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.10518317 |
| Minimum | 0 |
|---|---|
| Maximum | 198 |
| Zeros | 1416432 |
| Zeros (%) | 99.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 198 |
| Range | 198 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.5055693 |
|---|---|
| Coefficient of variation (CV) | 14.313786 |
| Kurtosis | 1935.8081 |
| Mean | 0.10518317 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 31.240444 |
| Sum | 150186.21 |
| Variance | 2.2667391 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1416432 | |
| 9.119999886 | 1820 | 0.1% |
| 7.027777672 | 722 | 0.1% |
| 8.640000343 | 699 | < 0.1% |
| 21 | 648 | < 0.1% |
| 8.739999771 | 397 | < 0.1% |
| 8.359999657 | 367 | < 0.1% |
| 9.5 | 343 | < 0.1% |
| 7.347222328 | 314 | < 0.1% |
| 7.666666508 | 297 | < 0.1% |
| Other values (383) | 5815 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 1416432 | |
| 0.5 | 1 | < 0.1% |
| 0.5699999928 | 1 | < 0.1% |
| 0.5849999785 | 1 | < 0.1% |
| 0.9750000238 | 1 | < 0.1% |
| 1 | 7 | < 0.1% |
| 1.019999981 | 1 | < 0.1% |
| 1.2109375 | 1 | < 0.1% |
| 1.458333373 | 1 | < 0.1% |
| 1.5 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 198 | 1 | < 0.1% |
| 180 | 1 | < 0.1% |
| 165 | 2 | |
| 163.5 | 2 | |
| 162 | 1 | < 0.1% |
| 144 | 3 | |
| 141 | 1 | < 0.1% |
| 133.5 | 1 | < 0.1% |
| 127.5 | 1 | < 0.1% |
| 125 | 1 | < 0.1% |
InspectionDay
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.8 MiB |
| Tuesday | |
|---|---|
| Wednesday | |
| Monday | |
| Thursday | |
| Friday | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.2592821 |
| Min length | 6 |
Characters and Unicode
| Total characters | 10365195 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Thursday |
|---|---|
| 2nd row | Thursday |
| 3rd row | Thursday |
| 4th row | Thursday |
| 5th row | Thursday |
Common Values
| Value | Count | Frequency (%) |
| Tuesday | 292773 | |
| Wednesday | 286392 | |
| Monday | 285921 | |
| Thursday | 281702 | |
| Friday | 218370 | |
| Saturday | 41359 | 2.9% |
| Sunday | 21337 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| tuesday | 292773 | |
| wednesday | 286392 | |
| monday | 285921 | |
| thursday | 281702 | |
| friday | 218370 | |
| saturday | 41359 | 2.9% |
| sunday | 21337 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 1714246 | |
| a | 1469213 | |
| y | 1427854 | |
| e | 865557 | |
| s | 860867 | |
| u | 637171 | 6.1% |
| n | 593650 | 5.7% |
| T | 574475 | 5.5% |
| r | 541431 | 5.2% |
| W | 286392 | 2.8% |
| Other values (7) | 1394339 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8937341 | |
| Uppercase Letter | 1427854 | 13.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 1714246 | |
| a | 1469213 | |
| y | 1427854 | |
| e | 865557 | |
| s | 860867 | |
| u | 637171 | 7.1% |
| n | 593650 | 6.6% |
| r | 541431 | 6.1% |
| o | 285921 | 3.2% |
| h | 281702 | 3.2% |
| Other values (2) | 259729 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 574475 | |
| W | 286392 | |
| M | 285921 | |
| F | 218370 | 15.3% |
| S | 62696 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10365195 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 1714246 | |
| a | 1469213 | |
| y | 1427854 | |
| e | 865557 | |
| s | 860867 | |
| u | 637171 | 6.1% |
| n | 593650 | 5.7% |
| T | 574475 | 5.5% |
| r | 541431 | 5.2% |
| W | 286392 | 2.8% |
| Other values (7) | 1394339 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10365195 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| d | 1714246 | |
| a | 1469213 | |
| y | 1427854 | |
| e | 865557 | |
| s | 860867 | |
| u | 637171 | 6.1% |
| n | 593650 | 5.7% |
| T | 574475 | 5.5% |
| r | 541431 | 5.2% |
| W | 286392 | 2.8% |
| Other values (7) | 1394339 |
InspectionYear
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2019.2919 |
| Minimum | 2010 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 2010 |
|---|---|
| 5-th percentile | 2018 |
| Q1 | 2019 |
| median | 2019 |
| Q3 | 2020 |
| 95-th percentile | 2020 |
| Maximum | 2021 |
| Range | 11 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.0062237 |
|---|---|
| Coefficient of variation (CV) | 0.00049830519 |
| Kurtosis | 17.58833 |
| Mean | 2019.2919 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -3.4349364 |
| Sum | 2.8832541 × 109 |
| Variance | 1.012486 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2019 | 674030 | |
| 2020 | 656873 | |
| 2018 | 32287 | 2.3% |
| 2017 | 30592 | 2.1% |
| 2016 | 9895 | 0.7% |
| 2015 | 7461 | 0.5% |
| 2013 | 6177 | 0.4% |
| 2014 | 5926 | 0.4% |
| 2012 | 2166 | 0.2% |
| 2021 | 1445 | 0.1% |
| Other values (2) | 1002 | 0.1% |
| Value | Count | Frequency (%) |
| 2010 | 50 | < 0.1% |
| 2011 | 952 | 0.1% |
| 2012 | 2166 | 0.2% |
| 2013 | 6177 | 0.4% |
| 2014 | 5926 | 0.4% |
| 2015 | 7461 | 0.5% |
| 2016 | 9895 | 0.7% |
| 2017 | 30592 | 2.1% |
| 2018 | 32287 | 2.3% |
| 2019 | 674030 |
| Value | Count | Frequency (%) |
| 2021 | 1445 | 0.1% |
| 2020 | 656873 | |
| 2019 | 674030 | |
| 2018 | 32287 | 2.3% |
| 2017 | 30592 | 2.1% |
| 2016 | 9895 | 0.7% |
| 2015 | 7461 | 0.5% |
| 2014 | 5926 | 0.4% |
| 2013 | 6177 | 0.4% |
| 2012 | 2166 | 0.2% |
InspectionDate
Date
| Distinct | 3041 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.8 MiB |
| Minimum | 2010-10-01 00:00:00 |
|---|---|
| Maximum | 2020-12-31 00:00:00 |
MonthsLastRev
Real number (ℝ)
| Distinct | 128 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.74752 |
| Minimum | 0 |
|---|---|
| Maximum | 132 |
| Zeros | 727 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 24 |
| median | 24 |
| Q3 | 24 |
| 95-th percentile | 26 |
| Maximum | 132 |
| Range | 132 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 6.252464 |
|---|---|
| Coefficient of variation (CV) | 0.25265013 |
| Kurtosis | 40.644295 |
| Mean | 24.74752 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.1776128 |
| Sum | 35335845 |
| Variance | 39.093306 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 884791 | |
| 25 | 154980 | 10.9% |
| 23 | 152541 | 10.7% |
| 22 | 87421 | 6.1% |
| 48 | 29299 | 2.1% |
| 21 | 26477 | 1.9% |
| 26 | 18466 | 1.3% |
| 49 | 6819 | 0.5% |
| 47 | 4902 | 0.3% |
| 20 | 4253 | 0.3% |
| Other values (118) | 57905 | 4.1% |
| Value | Count | Frequency (%) |
| 0 | 727 | |
| 1 | 113 | < 0.1% |
| 2 | 370 | < 0.1% |
| 3 | 769 | |
| 4 | 716 | |
| 5 | 696 | |
| 6 | 665 | |
| 7 | 919 | |
| 8 | 961 | |
| 9 | 1174 |
| Value | Count | Frequency (%) |
| 132 | 1 | < 0.1% |
| 131 | 1 | < 0.1% |
| 130 | 2 | < 0.1% |
| 128 | 2 | < 0.1% |
| 125 | 3 | < 0.1% |
| 122 | 4 | < 0.1% |
| 121 | 17 | |
| 120 | 33 | |
| 119 | 12 | < 0.1% |
| 118 | 3 | < 0.1% |
Risk_S*I
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 57 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.030705905 |
| Minimum | 0 |
|---|---|
| Maximum | 15 |
| Zeros | 1416383 |
| Zeros (%) | 99.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.35262242 |
|---|---|
| Coefficient of variation (CV) | 11.483864 |
| Kurtosis | 180.5955 |
| Mean | 0.030705905 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.585851 |
| Sum | 43843.549 |
| Variance | 0.12434257 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1416383 | |
| 3.799999952 | 3020 | 0.2% |
| 3.5 | 1811 | 0.1% |
| 3.833333254 | 1603 | 0.1% |
| 3.666666746 | 997 | 0.1% |
| 3.599999905 | 952 | 0.1% |
| 3 | 848 | 0.1% |
| 3.400000095 | 471 | < 0.1% |
| 3.75 | 408 | < 0.1% |
| 7.333333492 | 159 | < 0.1% |
| Other values (47) | 1202 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1416383 | |
| 1 | 81 | < 0.1% |
| 2 | 140 | < 0.1% |
| 2.5 | 50 | < 0.1% |
| 3 | 848 | 0.1% |
| 3.25 | 45 | < 0.1% |
| 3.333333254 | 108 | < 0.1% |
| 3.400000095 | 471 | < 0.1% |
| 3.5 | 1811 | 0.1% |
| 3.571428537 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 15 | 1 | < 0.1% |
| 13.33333302 | 2 | < 0.1% |
| 12 | 1 | < 0.1% |
| 10.71428585 | 1 | < 0.1% |
| 10.66666698 | 1 | < 0.1% |
| 10.5 | 11 | |
| 10.19999981 | 13 | |
| 10 | 3 | < 0.1% |
| 9.75 | 6 | |
| 9.600000381 | 5 | < 0.1% |
Severity
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.8 MiB |
| 4 | |
|---|---|
| 3 | 2179 |
| 2 | 545 |
| 1 | 207 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1427854 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 4 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 4 | 1424923 | |
| 3 | 2179 | 0.2% |
| 2 | 545 | < 0.1% |
| 1 | 207 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4 | 1424923 | |
| 3 | 2179 | 0.2% |
| 2 | 545 | < 0.1% |
| 1 | 207 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 1424923 | |
| 3 | 2179 | 0.2% |
| 2 | 545 | < 0.1% |
| 1 | 207 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1427854 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1424923 | |
| 3 | 2179 | 0.2% |
| 2 | 545 | < 0.1% |
| 1 | 207 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1427854 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 1424923 | |
| 3 | 2179 | 0.2% |
| 2 | 545 | < 0.1% |
| 1 | 207 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1427854 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 1424923 | |
| 3 | 2179 | 0.2% |
| 2 | 545 | < 0.1% |
| 1 | 207 | < 0.1% |
Incidence
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.8 MiB |
| 0 | |
|---|---|
| 1 | 2931 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1427854 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1424923 | |
| 1 | 2931 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1424923 | |
| 1 | 2931 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1424923 | |
| 1 | 2931 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1427854 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1424923 | |
| 1 | 2931 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1427854 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1424923 | |
| 1 | 2931 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1427854 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1424923 | |
| 1 | 2931 | 0.2% |
Province
Categorical
| Distinct | 38 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.8 MiB |
| Barcelona | |
|---|---|
| Valencia | |
| Madrid | |
| Girona | |
| Tarragona | |
| Other values (33) |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 7.8627409 |
| Min length | 4 |
Characters and Unicode
| Total characters | 11226846 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Valencia |
|---|---|
| 2nd row | Barcelona |
| 3rd row | Valencia |
| 4th row | Valencia |
| 5th row | Barcelona |
Common Values
| Value | Count | Frequency (%) |
| Barcelona | 383967 | |
| Valencia | 136075 | 9.5% |
| Madrid | 119126 | 8.3% |
| Girona | 85030 | 6.0% |
| Tarragona | 75613 | 5.3% |
| Alicante | 63681 | 4.5% |
| La Coruña | 44670 | 3.1% |
| Sevilla | 43903 | 3.1% |
| Toledo | 39207 | 2.7% |
| Pontevedra | 38736 | 2.7% |
| Other values (28) | 397846 |
Length
| Value | Count | Frequency (%) |
| barcelona | 383967 | |
| valencia | 136075 | 9.0% |
| madrid | 119126 | 7.8% |
| girona | 85030 | 5.6% |
| tarragona | 75613 | 5.0% |
| la | 66409 | 4.4% |
| alicante | 63681 | 4.2% |
| coruña | 44670 | 2.9% |
| sevilla | 43903 | 2.9% |
| toledo | 39207 | 2.6% |
| Other values (30) | 461228 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2322089 | |
| l | 1053570 | |
| e | 945016 | |
| r | 921676 | 8.2% |
| n | 904191 | 8.1% |
| o | 837574 | 7.5% |
| c | 631892 | 5.6% |
| i | 601135 | 5.4% |
| d | 535799 | 4.8% |
| B | 401733 | 3.6% |
| Other values (28) | 2072171 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9616882 | |
| Uppercase Letter | 1518909 | 13.5% |
| Space Separator | 91055 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2322089 | |
| l | 1053570 | |
| e | 945016 | |
| r | 921676 | 9.6% |
| n | 904191 | 9.4% |
| o | 837574 | 8.7% |
| c | 631892 | 6.6% |
| i | 601135 | 6.3% |
| d | 535799 | 5.6% |
| t | 149494 | 1.6% |
| Other values (12) | 714446 | 7.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 401733 | |
| V | 173339 | |
| M | 150955 | 9.9% |
| L | 138589 | 9.1% |
| C | 135862 | 8.9% |
| T | 115659 | 7.6% |
| G | 115363 | 7.6% |
| A | 80422 | 5.3% |
| S | 72373 | 4.8% |
| P | 47901 | 3.2% |
| Other values (5) | 86713 | 5.7% |
Space Separator
| Value | Count | Frequency (%) |
| 91055 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11135791 | |
| Common | 91055 | 0.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2322089 | |
| l | 1053570 | |
| e | 945016 | |
| r | 921676 | 8.3% |
| n | 904191 | 8.1% |
| o | 837574 | 7.5% |
| c | 631892 | 5.7% |
| i | 601135 | 5.4% |
| d | 535799 | 4.8% |
| B | 401733 | 3.6% |
| Other values (27) | 1981116 |
Common
| Value | Count | Frequency (%) |
| 91055 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11060112 | |
| None | 166734 | 1.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2322089 | |
| l | 1053570 | |
| e | 945016 | |
| r | 921676 | 8.3% |
| n | 904191 | 8.2% |
| o | 837574 | 7.6% |
| c | 631892 | 5.7% |
| i | 601135 | 5.4% |
| d | 535799 | 4.8% |
| B | 401733 | 3.6% |
| Other values (24) | 1905437 |
None
| Value | Count | Frequency (%) |
| ó | 76599 | |
| ñ | 44670 | |
| á | 39472 | |
| é | 5993 | 3.6% |
Town
Categorical
| Distinct | 1960 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.8 MiB |
| Madrid | 75616 |
|---|---|
| Barcelona | 58408 |
| Valencia | 25405 |
| Sevilla | 22491 |
| Terrassa | 16573 |
| Other values (1955) |
Length
| Max length | 25 |
|---|---|
| Median length | 22 |
| Mean length | 10.504441 |
| Min length | 3 |
Characters and Unicode
| Total characters | 14998808 |
|---|---|
| Distinct characters | 65 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 92 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Betera |
|---|---|
| 2nd row | Sabadell |
| 3rd row | Betera |
| 4th row | Betera |
| 5th row | Sabadell |
Common Values
| Value | Count | Frequency (%) |
| Madrid | 75616 | 5.3% |
| Barcelona | 58408 | 4.1% |
| Valencia | 25405 | 1.8% |
| Sevilla | 22491 | 1.6% |
| Terrassa | 16573 | 1.2% |
| Málaga | 16363 | 1.1% |
| Sabadell | 15887 | 1.1% |
| Vigo | 14213 | 1.0% |
| Alicante/Alacant | 13899 | 1.0% |
| Valladolid | 13369 | 0.9% |
| Other values (1950) | 1155630 |
Length
| Value | Count | Frequency (%) |
| de | 223555 | 9.8% |
| madrid | 75616 | 3.3% |
| del | 75467 | 3.3% |
| sant | 61713 | 2.7% |
| la | 58728 | 2.6% |
| barcelona | 58408 | 2.6% |
| valles | 31698 | 1.4% |
| llobregat | 26697 | 1.2% |
| valencia | 26038 | 1.1% |
| sevilla | 22491 | 1.0% |
| Other values (2139) | 1627069 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2289631 | |
| e | 1431172 | 9.5% |
| l | 1372557 | 9.2% |
| r | 1012677 | 6.8% |
| 859641 | 5.7% | |
| o | 844774 | 5.6% |
| n | 756321 | 5.0% |
| d | 731066 | 4.9% |
| i | 697670 | 4.7% |
| s | 558622 | 3.7% |
| Other values (55) | 4444677 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11993684 | |
| Uppercase Letter | 2043798 | 13.6% |
| Space Separator | 859641 | 5.7% |
| Other Punctuation | 79065 | 0.5% |
| Dash Punctuation | 22612 | 0.2% |
| Decimal Number | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2289631 | |
| e | 1431172 | |
| l | 1372557 | |
| r | 1012677 | |
| o | 844774 | 7.0% |
| n | 756321 | 6.3% |
| d | 731066 | 6.1% |
| i | 697670 | 5.8% |
| s | 558622 | 4.7% |
| t | 501573 | 4.2% |
| Other values (23) | 1797621 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 245922 | |
| S | 227354 | |
| C | 227219 | |
| V | 201057 | |
| A | 178438 | |
| B | 163200 | |
| P | 122254 | 6.0% |
| L | 110181 | 5.4% |
| R | 93596 | 4.6% |
| T | 89573 | 4.4% |
| Other values (16) | 385004 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 50954 | |
| ' | 26016 | |
| . | 2095 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 859641 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 22612 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14037482 | |
| Common | 961326 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2289631 | |
| e | 1431172 | 10.2% |
| l | 1372557 | 9.8% |
| r | 1012677 | 7.2% |
| o | 844774 | 6.0% |
| n | 756321 | 5.4% |
| d | 731066 | 5.2% |
| i | 697670 | 5.0% |
| s | 558622 | 4.0% |
| t | 501573 | 3.6% |
| Other values (49) | 3841419 |
Common
| Value | Count | Frequency (%) |
| 859641 | ||
| / | 50954 | 5.3% |
| ' | 26016 | 2.7% |
| - | 22612 | 2.4% |
| . | 2095 | 0.2% |
| 7 | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14851894 | |
| None | 146914 | 1.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2289631 | |
| e | 1431172 | 9.6% |
| l | 1372557 | 9.2% |
| r | 1012677 | 6.8% |
| 859641 | 5.8% | |
| o | 844774 | 5.7% |
| n | 756321 | 5.1% |
| d | 731066 | 4.9% |
| i | 697670 | 4.7% |
| s | 558622 | 3.8% |
| Other values (44) | 4297763 |
None
| Value | Count | Frequency (%) |
| ñ | 41082 | |
| ó | 30988 | |
| á | 30483 | |
| í | 11558 | 7.9% |
| é | 11267 | 7.7% |
| ç | 6777 | 4.6% |
| à | 6129 | 4.2% |
| è | 5464 | 3.7% |
| ú | 2676 | 1.8% |
| Á | 304 | 0.2% |
YearBuilt
Real number (ℝ)
| Distinct | 95 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2002.3953 |
| Minimum | 1901 |
|---|---|
| Maximum | 2050 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 1901 |
|---|---|
| 5-th percentile | 1986 |
| Q1 | 1998 |
| median | 2004 |
| Q3 | 2009 |
| 95-th percentile | 2016 |
| Maximum | 2050 |
| Range | 149 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 12.063572 |
|---|---|
| Coefficient of variation (CV) | 0.0060245706 |
| Kurtosis | 22.494039 |
| Mean | 2002.3953 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -3.4324023 |
| Sum | 2.8591282 × 109 |
| Variance | 145.52977 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2002 | 86257 | 6.0% |
| 2004 | 75048 | 5.3% |
| 2003 | 68950 | 4.8% |
| 2005 | 67861 | 4.8% |
| 2001 | 67333 | 4.7% |
| 2008 | 66563 | 4.7% |
| 2006 | 64662 | 4.5% |
| 2007 | 62125 | 4.4% |
| 2016 | 56975 | 4.0% |
| 2009 | 56945 | 4.0% |
| Other values (85) | 755135 |
| Value | Count | Frequency (%) |
| 1901 | 6068 | |
| 1912 | 2 | < 0.1% |
| 1914 | 2 | < 0.1% |
| 1920 | 1 | < 0.1% |
| 1923 | 1 | < 0.1% |
| 1925 | 2 | < 0.1% |
| 1926 | 1 | < 0.1% |
| 1927 | 2 | < 0.1% |
| 1928 | 5 | < 0.1% |
| 1929 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 2050 | 29 | < 0.1% |
| 2022 | 21 | < 0.1% |
| 2021 | 230 | < 0.1% |
| 2020 | 2109 | 0.1% |
| 2019 | 6192 | 0.4% |
| 2018 | 14101 | 1.0% |
| 2017 | 19957 | 1.4% |
| 2016 | 56975 | |
| 2015 | 52253 | |
| 2014 | 36523 |
Material
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.8 MiB |
| PE | |
|---|---|
| AO | |
| FD | 44305 |
| PN | 15251 |
| CU | 6624 |
| Other values (6) | 2441 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2855708 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PE |
|---|---|
| 2nd row | PE |
| 3rd row | AO |
| 4th row | AO |
| 5th row | PE |
Common Values
| Value | Count | Frequency (%) |
| PE | 1232020 | |
| AO | 127213 | 8.9% |
| FD | 44305 | 3.1% |
| PN | 15251 | 1.1% |
| CU | 6624 | 0.5% |
| ZD | 2377 | 0.2% |
| FG | 22 | < 0.1% |
| PA | 13 | < 0.1% |
| FI | 13 | < 0.1% |
| PV | 12 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| pe | 1232020 | |
| ao | 127213 | 8.9% |
| fd | 44305 | 3.1% |
| pn | 15251 | 1.1% |
| cu | 6624 | 0.5% |
| zd | 2377 | 0.2% |
| fg | 22 | < 0.1% |
| pa | 13 | < 0.1% |
| fi | 13 | < 0.1% |
| pv | 12 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 1247296 | |
| E | 1232020 | |
| A | 127226 | 4.5% |
| O | 127217 | 4.5% |
| D | 46682 | 1.6% |
| F | 44344 | 1.6% |
| N | 15251 | 0.5% |
| C | 6624 | 0.2% |
| U | 6624 | 0.2% |
| Z | 2377 | 0.1% |
| Other values (3) | 47 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2855708 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1247296 | |
| E | 1232020 | |
| A | 127226 | 4.5% |
| O | 127217 | 4.5% |
| D | 46682 | 1.6% |
| F | 44344 | 1.6% |
| N | 15251 | 0.5% |
| C | 6624 | 0.2% |
| U | 6624 | 0.2% |
| Z | 2377 | 0.1% |
| Other values (3) | 47 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2855708 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 1247296 | |
| E | 1232020 | |
| A | 127226 | 4.5% |
| O | 127217 | 4.5% |
| D | 46682 | 1.6% |
| F | 44344 | 1.6% |
| N | 15251 | 0.5% |
| C | 6624 | 0.2% |
| U | 6624 | 0.2% |
| Z | 2377 | 0.1% |
| Other values (3) | 47 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2855708 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 1247296 | |
| E | 1232020 | |
| A | 127226 | 4.5% |
| O | 127217 | 4.5% |
| D | 46682 | 1.6% |
| F | 44344 | 1.6% |
| N | 15251 | 0.5% |
| C | 6624 | 0.2% |
| U | 6624 | 0.2% |
| Z | 2377 | 0.1% |
| Other values (3) | 47 | < 0.1% |
Diameter
Real number (ℝ)
| Distinct | 62 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 116.63386 |
| Minimum | 10 |
|---|---|
| Maximum | 609.59998 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 40 |
| Q1 | 76.199997 |
| median | 110 |
| Q3 | 160 |
| 95-th percentile | 203.2 |
| Maximum | 609.59998 |
| Range | 599.59998 |
| Interquartile range (IQR) | 83.800003 |
Descriptive statistics
| Standard deviation | 57.88525 |
|---|---|
| Coefficient of variation (CV) | 0.49629883 |
| Kurtosis | 4.4144392 |
| Mean | 116.63386 |
| Median Absolute Deviation (MAD) | 47 |
| Skewness | 1.5130728 |
| Sum | 1.6653613 × 108 |
| Variance | 3350.7021 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 110 | 350517 | |
| 63 | 264611 | |
| 90 | 251628 | |
| 160 | 180501 | |
| 200 | 129395 | 9.1% |
| 40 | 51319 | 3.6% |
| 152.3999939 | 29432 | 2.1% |
| 101.5999985 | 24268 | 1.7% |
| 203.1999969 | 20836 | 1.5% |
| 250 | 15951 | 1.1% |
| Other values (52) | 109396 | 7.7% |
| Value | Count | Frequency (%) |
| 10 | 83 | < 0.1% |
| 11 | 31 | < 0.1% |
| 12 | 621 | < 0.1% |
| 12.69999981 | 7 | < 0.1% |
| 13 | 59 | < 0.1% |
| 14 | 29 | < 0.1% |
| 15 | 2078 | |
| 16 | 565 | < 0.1% |
| 18 | 9 | < 0.1% |
| 19 | 1768 |
| Value | Count | Frequency (%) |
| 609.5999756 | 147 | < 0.1% |
| 558.7999878 | 26 | < 0.1% |
| 508 | 1024 | 0.1% |
| 500 | 45 | < 0.1% |
| 457.2000122 | 529 | < 0.1% |
| 406.3999939 | 3530 | |
| 400 | 120 | < 0.1% |
| 355.6000061 | 317 | < 0.1% |
| 355 | 28 | < 0.1% |
| 350 | 82 | < 0.1% |
Length
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 185608 |
|---|---|
| Distinct (%) | 13.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.792207 |
| Minimum | 0 |
|---|---|
| Maximum | 26100.943 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.79900002 |
| Q1 | 3.5 |
| median | 13.598 |
| Q3 | 44.377748 |
| 95-th percentile | 137.495 |
| Maximum | 26100.943 |
| Range | 26100.943 |
| Interquartile range (IQR) | 40.877748 |
Descriptive statistics
| Standard deviation | 78.154167 |
|---|---|
| Coefficient of variation (CV) | 2.1242044 |
| Kurtosis | 17570 |
| Mean | 36.792207 |
| Median Absolute Deviation (MAD) | 12.094 |
| Skewness | 61.117271 |
| Sum | 52533900 |
| Variance | 6108.0737 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 8817 | 0.6% |
| 0.5 | 7786 | 0.5% |
| 2 | 5995 | 0.4% |
| 1.001999974 | 4854 | 0.3% |
| 1.5 | 4172 | 0.3% |
| 1.001000047 | 3024 | 0.2% |
| 1.003000021 | 2869 | 0.2% |
| 0.5009999871 | 2098 | 0.1% |
| 1.200000048 | 1994 | 0.1% |
| 3 | 1989 | 0.1% |
| Other values (185598) | 1384256 |
| Value | Count | Frequency (%) |
| 0 | 4 | < 0.1% |
| 0.004999999888 | 9 | < 0.1% |
| 0.006000000052 | 10 | < 0.1% |
| 0.007000000216 | 9 | < 0.1% |
| 0.00800000038 | 16 | < 0.1% |
| 0.008999999613 | 16 | < 0.1% |
| 0.009999999776 | 43 | |
| 0.01099999994 | 21 | |
| 0.0120000001 | 13 | < 0.1% |
| 0.01300000027 | 21 |
| Value | Count | Frequency (%) |
| 26100.94336 | 1 | |
| 26030.14844 | 1 | |
| 7291.366211 | 1 | |
| 7281.373047 | 1 | |
| 5801.324219 | 1 | |
| 5128.307129 | 1 | |
| 4808.214844 | 1 | |
| 4738.890137 | 1 | |
| 4737.599121 | 1 | |
| 4690.916992 | 1 |
Pressure
Real number (ℝ)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0660291 |
| Minimum | 0.025 |
|---|---|
| Maximum | 80 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0.025 |
|---|---|
| 5-th percentile | 0.025 |
| Q1 | 0.1 |
| median | 0.15000001 |
| Q3 | 4 |
| 95-th percentile | 16 |
| Maximum | 80 |
| Range | 79.975 |
| Interquartile range (IQR) | 3.9 |
Descriptive statistics
| Standard deviation | 6.9117017 |
|---|---|
| Coefficient of variation (CV) | 2.2542844 |
| Kurtosis | 41.851929 |
| Mean | 3.0660291 |
| Median Absolute Deviation (MAD) | 0.125 |
| Skewness | 5.669332 |
| Sum | 4377842 |
| Variance | 47.771618 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 410666 | |
| 0.150000006 | 362501 | |
| 0.02500000037 | 254694 | |
| 0.1000000015 | 126904 | 8.9% |
| 16 | 81718 | 5.7% |
| 1.700000048 | 63758 | 4.5% |
| 0.400000006 | 56009 | 3.9% |
| 5 | 39695 | 2.8% |
| 49.5 | 10003 | 0.7% |
| 0.05000000075 | 4927 | 0.3% |
| Other values (10) | 16979 | 1.2% |
| Value | Count | Frequency (%) |
| 0.02500000037 | 254694 | |
| 0.05000000075 | 4927 | 0.3% |
| 0.1000000015 | 126904 | 8.9% |
| 0.150000006 | 362501 | |
| 0.400000006 | 56009 | 3.9% |
| 1.700000048 | 63758 | 4.5% |
| 2 | 4405 | 0.3% |
| 4 | 410666 | |
| 5 | 39695 | 2.8% |
| 10 | 1922 | 0.1% |
| Value | Count | Frequency (%) |
| 80 | 1180 | 0.1% |
| 72 | 1408 | 0.1% |
| 59.5 | 1332 | 0.1% |
| 49.5 | 10003 | 0.7% |
| 45 | 2350 | 0.2% |
| 40 | 234 | < 0.1% |
| 36 | 2462 | 0.2% |
| 25 | 197 | < 0.1% |
| 16 | 81718 | |
| 12 | 1489 | 0.1% |
NumConnections
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.96511478 |
| Minimum | 0 |
|---|---|
| Maximum | 88 |
| Zeros | 885989 |
| Zeros (%) | 62.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 5 |
| Maximum | 88 |
| Range | 88 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 2.1584117 |
|---|---|
| Coefficient of variation (CV) | 2.23643 |
| Kurtosis | 57.135918 |
| Mean | 0.96511478 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.526978 |
| Sum | 1378043 |
| Variance | 4.658741 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 885989 | |
| 1 | 271523 | 19.0% |
| 2 | 111389 | 7.8% |
| 3 | 53893 | 3.8% |
| 4 | 32683 | 2.3% |
| 5 | 20320 | 1.4% |
| 6 | 13963 | 1.0% |
| 7 | 9230 | 0.6% |
| 8 | 7026 | 0.5% |
| 9 | 4887 | 0.3% |
| Other values (54) | 16951 | 1.2% |
| Value | Count | Frequency (%) |
| 0 | 885989 | |
| 1 | 271523 | 19.0% |
| 2 | 111389 | 7.8% |
| 3 | 53893 | 3.8% |
| 4 | 32683 | 2.3% |
| 5 | 20320 | 1.4% |
| 6 | 13963 | 1.0% |
| 7 | 9230 | 0.6% |
| 8 | 7026 | 0.5% |
| 9 | 4887 | 0.3% |
| Value | Count | Frequency (%) |
| 88 | 1 | < 0.1% |
| 83 | 1 | < 0.1% |
| 79 | 1 | < 0.1% |
| 65 | 1 | < 0.1% |
| 63 | 2 | |
| 60 | 3 | |
| 59 | 1 | < 0.1% |
| 58 | 4 | |
| 55 | 2 | |
| 54 | 3 |
NumConnectionsUnder
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.8 MiB |
| 0 | |
|---|---|
| 1 | 418 |
| 2 | 28 |
| 3 | 4 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1427854 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1427403 | |
| 1 | 418 | < 0.1% |
| 2 | 28 | < 0.1% |
| 3 | 4 | < 0.1% |
| 4 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1427403 | |
| 1 | 418 | < 0.1% |
| 2 | 28 | < 0.1% |
| 3 | 4 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1427403 | |
| 1 | 418 | < 0.1% |
| 2 | 28 | < 0.1% |
| 3 | 4 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1427854 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1427403 | |
| 1 | 418 | < 0.1% |
| 2 | 28 | < 0.1% |
| 3 | 4 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1427854 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1427403 | |
| 1 | 418 | < 0.1% |
| 2 | 28 | < 0.1% |
| 3 | 4 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1427854 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1427403 | |
| 1 | 418 | < 0.1% |
| 2 | 28 | < 0.1% |
| 3 | 4 | < 0.1% |
| 4 | 1 | < 0.1% |
BoolBridle
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.8 MiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1427854 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1427854 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1427854 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1427854 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1427854 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1427854 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1427854 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1427854 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1427854 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1427854 |
gas_natural
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.8 MiB |
| 1 | |
|---|---|
| 0 | 56434 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1427854 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1371420 | |
| 0 | 56434 | 4.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1371420 | |
| 0 | 56434 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1371420 | |
| 0 | 56434 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1427854 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1371420 | |
| 0 | 56434 | 4.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1427854 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1371420 | |
| 0 | 56434 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1427854 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1371420 | |
| 0 | 56434 | 4.0% |
| PipeId | Inspections | No_Incidents | Risk_S*I/Inspections | leakage_estimate_factor | InspectionYear | MonthsLastRev | Risk_S*I | YearBuilt | Diameter | Length | Pressure | NumConnections | InspectionDay | Severity | Incidence | Province | Material | NumConnectionsUnder | gas_natural | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PipeId | 1.000 | 0.385 | -0.029 | -0.029 | -0.029 | 0.004 | 0.129 | -0.029 | -0.020 | 0.171 | -0.027 | -0.001 | -0.068 | 0.028 | 0.019 | 0.029 | 0.297 | 0.129 | 0.010 | 0.348 |
| Inspections | 0.385 | 1.000 | -0.003 | -0.004 | -0.003 | 0.356 | -0.073 | -0.003 | -0.481 | 0.185 | 0.119 | -0.164 | 0.070 | 0.019 | 0.028 | 0.048 | 0.140 | 0.107 | 0.010 | 0.449 |
| No_Incidents | -0.029 | -0.003 | 1.000 | 1.000 | 0.998 | 0.004 | -0.045 | 1.000 | -0.030 | -0.011 | 0.064 | -0.028 | 0.095 | 0.003 | 0.297 | 0.513 | 0.039 | 0.038 | 0.000 | 0.057 |
| Risk_S*I/Inspections | -0.029 | -0.004 | 1.000 | 1.000 | 0.998 | 0.004 | -0.045 | 1.000 | -0.030 | -0.012 | 0.064 | -0.028 | 0.095 | 0.004 | 0.520 | 0.677 | 0.034 | 0.072 | 0.000 | 0.129 |
| leakage_estimate_factor | -0.029 | -0.003 | 0.998 | 0.998 | 1.000 | 0.005 | -0.044 | 0.998 | -0.030 | -0.011 | 0.064 | -0.028 | 0.096 | 0.002 | 0.276 | 0.451 | 0.009 | 0.043 | 0.000 | 0.101 |
| InspectionYear | 0.004 | 0.356 | 0.004 | 0.004 | 0.005 | 1.000 | 0.006 | 0.004 | -0.029 | 0.016 | 0.131 | -0.060 | 0.079 | 0.020 | 0.034 | 0.055 | 0.072 | 0.049 | 0.000 | 0.071 |
| MonthsLastRev | 0.129 | -0.073 | -0.045 | -0.045 | -0.044 | 0.006 | 1.000 | -0.045 | -0.053 | 0.060 | -0.047 | 0.014 | -0.059 | 0.012 | 0.024 | 0.035 | 0.082 | 0.119 | 0.004 | 0.057 |
| Risk_S*I | -0.029 | -0.003 | 1.000 | 1.000 | 0.998 | 0.004 | -0.045 | 1.000 | -0.030 | -0.011 | 0.064 | -0.028 | 0.095 | 0.003 | 0.371 | 0.633 | 0.030 | 0.057 | 0.000 | 0.064 |
| YearBuilt | -0.020 | -0.481 | -0.030 | -0.030 | -0.030 | -0.029 | -0.053 | -0.030 | 1.000 | -0.202 | 0.001 | 0.255 | -0.037 | 0.023 | 0.049 | 0.080 | 0.161 | 0.159 | 0.006 | 0.224 |
| Diameter | 0.171 | 0.185 | -0.011 | -0.012 | -0.011 | 0.016 | 0.060 | -0.011 | -0.202 | 1.000 | 0.025 | -0.298 | -0.188 | 0.017 | 0.017 | 0.027 | 0.160 | 0.150 | 0.003 | 0.307 |
| Length | -0.027 | 0.119 | 0.064 | 0.064 | 0.064 | 0.131 | -0.047 | 0.064 | 0.001 | 0.025 | 1.000 | 0.062 | 0.521 | 0.002 | 0.000 | 0.000 | 0.008 | 0.006 | 0.017 | 0.000 |
| Pressure | -0.001 | -0.164 | -0.028 | -0.028 | -0.028 | -0.060 | 0.014 | -0.028 | 0.255 | -0.298 | 0.062 | 1.000 | -0.132 | 0.025 | 0.005 | 0.009 | 0.125 | 0.311 | 0.001 | 0.057 |
| NumConnections | -0.068 | 0.070 | 0.095 | 0.095 | 0.096 | 0.079 | -0.059 | 0.095 | -0.037 | -0.188 | 0.521 | -0.132 | 1.000 | 0.002 | 0.030 | 0.051 | 0.015 | 0.014 | 0.033 | 0.054 |
| InspectionDay | 0.028 | 0.019 | 0.003 | 0.004 | 0.002 | 0.020 | 0.012 | 0.003 | 0.023 | 0.017 | 0.002 | 0.025 | 0.002 | 1.000 | 0.002 | 0.003 | 0.104 | 0.018 | 0.003 | 0.015 |
| Severity | 0.019 | 0.028 | 0.297 | 0.520 | 0.276 | 0.034 | 0.024 | 0.371 | 0.049 | 0.017 | 0.000 | 0.005 | 0.030 | 0.002 | 1.000 | 1.000 | 0.028 | 0.074 | 0.000 | 0.062 |
| Incidence | 0.029 | 0.048 | 0.513 | 0.677 | 0.451 | 0.055 | 0.035 | 0.633 | 0.080 | 0.027 | 0.000 | 0.009 | 0.051 | 0.003 | 1.000 | 1.000 | 0.041 | 0.111 | 0.000 | 0.055 |
| Province | 0.297 | 0.140 | 0.039 | 0.034 | 0.009 | 0.072 | 0.082 | 0.030 | 0.161 | 0.160 | 0.008 | 0.125 | 0.015 | 0.104 | 0.028 | 0.041 | 1.000 | 0.094 | 0.011 | 0.234 |
| Material | 0.129 | 0.107 | 0.038 | 0.072 | 0.043 | 0.049 | 0.119 | 0.057 | 0.159 | 0.150 | 0.006 | 0.311 | 0.014 | 0.018 | 0.074 | 0.111 | 0.094 | 1.000 | 0.011 | 0.323 |
| NumConnectionsUnder | 0.010 | 0.010 | 0.000 | 0.000 | 0.000 | 0.000 | 0.004 | 0.000 | 0.006 | 0.003 | 0.017 | 0.001 | 0.033 | 0.003 | 0.000 | 0.000 | 0.011 | 0.011 | 1.000 | 0.001 |
| gas_natural | 0.348 | 0.449 | 0.057 | 0.129 | 0.101 | 0.071 | 0.057 | 0.064 | 0.224 | 0.307 | 0.000 | 0.057 | 0.054 | 0.015 | 0.062 | 0.055 | 0.234 | 0.323 | 0.001 | 1.000 |
| PipeId | Inspections | No_Incidents | Risk_S*I/Inspections | leakage_estimate_factor | InspectionDay | InspectionYear | InspectionDate | MonthsLastRev | Risk_S*I | Severity | Incidence | Province | Town | YearBuilt | Material | Diameter | Length | Pressure | NumConnections | NumConnectionsUnder | BoolBridle | gas_natural | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6345343 | 56922465 | 1 | 0 | 0.0 | 0.0 | Thursday | 2020 | 2020-12-31 | 24 | 0.0 | 4 | 0 | Valencia | Betera | 1993 | PE | 63.000000 | 1.778000 | 4.000 | 0 | 0 | 0 | 1 |
| 534760 | 188341482 | 6 | 0 | 0.0 | 0.0 | Thursday | 2021 | 2020-12-31 | 23 | 0.0 | 4 | 0 | Barcelona | Sabadell | 1995 | PE | 200.000000 | 34.959999 | 0.025 | 0 | 0 | 0 | 1 |
| 4790727 | 189485681 | 6 | 0 | 0.0 | 0.0 | Thursday | 2020 | 2020-12-31 | 23 | 0.0 | 4 | 0 | Valencia | Betera | 1950 | AO | 50.799999 | 16.423000 | 4.000 | 0 | 0 | 0 | 1 |
| 4790765 | 189485654 | 6 | 0 | 0.0 | 0.0 | Thursday | 2020 | 2020-12-31 | 23 | 0.0 | 4 | 0 | Valencia | Betera | 1950 | AO | 50.799999 | 11.443000 | 4.000 | 0 | 0 | 0 | 1 |
| 535324 | 274990283 | 6 | 0 | 0.0 | 0.0 | Thursday | 2021 | 2020-12-31 | 23 | 0.0 | 4 | 0 | Barcelona | Sabadell | 2005 | PE | 160.000000 | 10.377000 | 0.025 | 0 | 0 | 0 | 1 |
| 535302 | 274925411 | 6 | 0 | 0.0 | 0.0 | Thursday | 2021 | 2020-12-31 | 23 | 0.0 | 4 | 0 | Barcelona | Sabadell | 2005 | PE | 200.000000 | 13.497000 | 0.025 | 1 | 0 | 0 | 1 |
| 4794338 | 189538742 | 6 | 0 | 0.0 | 0.0 | Thursday | 2020 | 2020-12-31 | 23 | 0.0 | 4 | 0 | Valencia | Betera | 1950 | AO | 50.799999 | 52.957001 | 4.000 | 0 | 0 | 0 | 1 |
| 534940 | 274990929 | 6 | 0 | 0.0 | 0.0 | Thursday | 2021 | 2020-12-31 | 23 | 0.0 | 4 | 0 | Barcelona | Sabadell | 1995 | PE | 200.000000 | 3.470000 | 0.025 | 0 | 0 | 0 | 1 |
| 534928 | 188341464 | 6 | 0 | 0.0 | 0.0 | Thursday | 2021 | 2020-12-31 | 23 | 0.0 | 4 | 0 | Barcelona | Sabadell | 1995 | PE | 200.000000 | 1.373000 | 0.025 | 0 | 0 | 0 | 1 |
| 534922 | 189215318 | 6 | 0 | 0.0 | 0.0 | Thursday | 2021 | 2020-12-31 | 23 | 0.0 | 4 | 0 | Barcelona | Sabadell | 2000 | PE | 250.000000 | 8.930000 | 0.025 | 0 | 0 | 0 | 1 |
| PipeId | Inspections | No_Incidents | Risk_S*I/Inspections | leakage_estimate_factor | InspectionDay | InspectionYear | InspectionDate | MonthsLastRev | Risk_S*I | Severity | Incidence | Province | Town | YearBuilt | Material | Diameter | Length | Pressure | NumConnections | NumConnectionsUnder | BoolBridle | gas_natural | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 81993 | 333980653 | 1 | 0 | 0.0 | 0.0 | Friday | 2010 | 2010-10-08 | 24 | 0.0 | 4 | 0 | Tarragona | Calafell | 2008 | PE | 160.0 | 0.600 | 0.15 | 0 | 0 | 0 | 1 |
| 81992 | 333980637 | 1 | 0 | 0.0 | 0.0 | Friday | 2010 | 2010-10-08 | 24 | 0.0 | 4 | 0 | Tarragona | Calafell | 2008 | PE | 160.0 | 0.601 | 0.15 | 0 | 0 | 0 | 1 |
| 81991 | 190852729 | 1 | 0 | 0.0 | 0.0 | Friday | 2010 | 2010-10-08 | 24 | 0.0 | 4 | 0 | Tarragona | Calafell | 2004 | PE | 90.0 | 0.501 | 0.15 | 0 | 0 | 0 | 1 |
| 54716 | 331062012 | 1 | 0 | 0.0 | 0.0 | Wednesday | 2010 | 2010-10-06 | 24 | 0.0 | 4 | 0 | Tarragona | Calafell | 2008 | PE | 90.0 | 1.000 | 0.15 | 0 | 0 | 0 | 1 |
| 54709 | 333980664 | 1 | 0 | 0.0 | 0.0 | Wednesday | 2010 | 2010-10-06 | 24 | 0.0 | 4 | 0 | Tarragona | Calafell | 2008 | PE | 160.0 | 0.601 | 0.15 | 0 | 0 | 0 | 1 |
| 54218 | 189142507 | 1 | 0 | 0.0 | 0.0 | Wednesday | 2010 | 2010-10-06 | 24 | 0.0 | 4 | 0 | Tarragona | Amposta | 2000 | PE | 110.0 | 0.694 | 0.15 | 0 | 0 | 0 | 1 |
| 45975 | 189141476 | 1 | 0 | 0.0 | 0.0 | Tuesday | 2010 | 2010-10-05 | 24 | 0.0 | 4 | 0 | Tarragona | Calafell | 2000 | PE | 110.0 | 1.188 | 0.15 | 0 | 0 | 0 | 1 |
| 39756 | 324551020 | 1 | 0 | 0.0 | 0.0 | Tuesday | 2010 | 2010-10-05 | 24 | 0.0 | 4 | 0 | Barcelona | Sentmenat | 2008 | PE | 110.0 | 0.802 | 0.10 | 0 | 0 | 0 | 1 |
| 39473 | 190908195 | 1 | 0 | 0.0 | 0.0 | Tuesday | 2010 | 2010-10-05 | 24 | 0.0 | 4 | 0 | Alicante | Alicante/Alacant | 2004 | PE | 200.0 | 0.999 | 0.15 | 0 | 0 | 0 | 1 |
| 2816 | 340613298 | 1 | 0 | 0.0 | 0.0 | Friday | 2010 | 2010-10-01 | 21 | 0.0 | 4 | 0 | Tarragona | Calafell | 2009 | PE | 90.0 | 1.101 | 0.15 | 0 | 0 | 0 | 1 |